MAISE: A Flexible, Configurable, Extensible Open Source Package for Mass AI System Evaluation

نویسنده

  • Omar Zaidan
چکیده

The past few years have seen an increasing interest in using Amazon’s Mechanical Turk for purposes of collecting data and performing annotation tasks. One such task is the mass evaluation of system output in a variety of tasks. In this paper, we present MAISE, a package that allows researchers to evaluate the output of their AI system(s) using human judgments collected via Amazon’s Mechanical Turk, greatly streamlining the process. MAISE is open source, easy to run, and platform-independent. The core of MAISE’s codebase was used for the manual evaluation of WMT10, and the completed package is being used again in the current evaluation for WMT11. In this paper, we describe the main features, functionality, and usage of MAISE, which is now available for download and use.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Open Source Learning Management Softwares and Presenting a Native Evaluation Tool

Introduction: Nowadays all educational institutes are trying to use technology in their structure. This effort has been faced with different barriers, including cost, time, and support. Therefore, using open source softwares can partially help us in using technology. In this article, we review main features of several open source learning management softwares, while presenting a tool which incl...

متن کامل

3D Finite element modeling for Dynamic Behavior Evaluation of Marin Risers Due to VIV and Internal Flow

The complete 3D nonlinear dynamic problem of extensible, flexible risers conveying fluid is considered. For describing the dynamics of the system, the Newtonian derivation procedure is followed. The velocity field inside the pipe formulated using hydrostatic and Bernoulli equations. The hydrodynamic effects of external fluids are taken into consideration through the nonlinear drag forces in var...

متن کامل

Sushi.R: flexible, quantitative and integrative genomic visualizations for publication-quality multi-panel figures

MOTIVATION Interpretation and communication of genomic data require flexible and quantitative tools to analyze and visualize diverse data types, and yet, a comprehensive tool to display all common genomic data types in publication quality figures does not exist to date. To address this shortcoming, we present Sushi.R, an R/Bioconductor package that allows flexible integration of genomic visuali...

متن کامل

Using Computer Games Techniques for Improving Graph Visualization Efficiency

Creating an efficient, interactive and flexible unified graph visualization system is a difficult problem. We present a hardware accelerated OpenGL graph drawing engine, in conjunction with a flexible preview package. While the interactive OpenGL visualization focuses on performance, the preview focuses on aesthetics and simple network map creation. The system is implemented as Gephi, a modular...

متن کامل

An Open-Source Package for Recognizing Textual Entailment

This paper presents a general-purpose open source package for recognizing Textual Entailment. The system implements a collection of algorithms, providing a configurable framework to quickly set up a working environment to experiment with the RTE task. Fast prototyping of new solutions is also allowed by the possibility to extend its modular architecture. We present the tool as a useful resource...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011